Search results for " data mining"
showing 10 items of 34 documents
Rings for Privacy: an Architecture for Large Scale Privacy-Preserving Data Mining
2021
This article proposes a new architecture for privacy-preserving data mining based on Multi Party Computation (MPC) and secure sums. While traditional MPC approaches rely on a small number of aggregation peers replacing a centralized trusted entity, the current study puts forth a distributed solution that involves all data sources in the aggregation process, with the help of a single server for storing intermediate results. A large-scale scenario is examined and the possibility that data become inaccessible during the aggregation process is considered, a possibility that traditional schemes often neglect. Here, it is explicitly examined, as it might be provoked by intermittent network connec…
Fragments of peer review: A quantitative analysis of the literature (1969-2015)
2018
This paper examines research on peer review between 1969 and 2015 by looking at records indexed from the Scopus database. Although it is often argued that peer review has been poorly investigated, we found that the number of publications in this field doubled from 2005. A half of this work was indexed as research articles, a third as editorial notes and literature reviews and the rest were book chapters or letters. We identified the most prolific and influential scholars, the most cited publications and the most important journals in the field. Co-authorship network analysis showed that research on peer review is fragmented, with the largest group of co-authors including only 2.1% of the wh…
Unlock ways to share data on peer review
2020
Peer review is the defining feature of scholarly communication. In a 2018 survey of more than 11, 000 researchers, 98% said that they considered peer review important or extremely important for ensuring the quality and integrity of scholarly communication.
Reverse-safe data structures for text indexing
2021
We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm which constructs a z-reverse-safe data structure that has size O(n) and answers pattern matching queries of length at most d optim…
Sensor Mining for User Behavior Profiling in Intelligent Environments
2011
The proposed system exploits sensor mining methodologies to profile user behaviors patterns in an intelligent workplace. The work is based in the assumption that users’ habit profiles are implicitly described by sensory data, which explicitly show the consequences of users’ actions over the environment state. Sensor data are analyzed in order to infer relationships of interest between environmental variables and the user, detecting in this way behavior profiles. The system is designed for a workplace equipped in the context of Sensor9k, a project carried out at the Department of Computer Science of Palermo University.
Medical news aggregation and ranking of taking into account the user needs
2019
The purpose of this work is to develop an intelligent information system that is designed for aggregation and ranking of news taking into account the needs of the user. The online market for mass media and the needs of readers, the purpose of their searches and moments is not enough to find the news is analyzed. A conceptual model of the information aggression system and ranking of news that would enable presentation of the work of the future intellectual information system, to show its structure is constructed. The methods and means for implementation of the intellectual information system are selected. An online resource for aggregation and ranking of news, news feeds and flexible setting…
Blended Learning als Spielfeld für Learning Analytics und Educational Data Mining
2020
Der Einsatz digitaler Lernformate im Blended Learning bietet demnach Chancen in mindestens zwei Bereichen. Zum einen konnen digitale Lernformate direkt die Lernprozesse von Studierenden gunstig beeinflussen, ihre Leistungen verbessern und zudem positive Effekte auf vielen weiteren Ebenen wie der Motivation oder des Selbstkonzeptes bewirken. Zum anderen generieren digitale Lernformate eine Fulle von Daten in vielfaltiger Gestalt. Studierende erzeugen bei der Arbeit mit digitalen Werkzeugen Nutzungsdaten, wie Verweildauern und Aktivitatsprofile, sie produzieren Leistungsdaten aus digitalen Aufgaben, sie hinterlassen Textbeitrage in Foren und Chats. All diese Daten konnen genutzt werden, um mi…
The Urban Landscape and the Real Estate Market. Structures and Fragments of the Axiological Tessitura in a Wide Urban Area of Palermo
2016
The proposed study deals with the urban landscape of Palermo and its possible representation from the perspective of the real estate market analysis. Real estate is one of the most significant types of capital asset and the wide range of its possible utilizations makes complex the interpretation of the market phenomena. The multi-layered reality of such a large city (represented through the sample of 500 properties) needs to be articulated into a significant set of sub-markets in order to outline the complexity and to map the distribution of homogeneous groups of properties within the whole city area. The comparison between quality and price within each cluster allows us to elicit the degre…
The Three Steps of Clustering In The Post-Genomic Era
2013
This chapter descibes the basic algorithmic components that are involved in clustering, with particular attention to classification of microarray data.
A Web Application for Interactive Visualization of European Basketball Data
2020
The statistical analysis of basketball games is a fast-growing field. Certainly, basketball data are scientifically relevant because an appropriate analysis provides a great deal of information about the performance of both players and teams. The number of games played each season generates a large amount of data worth analyzing. Basketball analytics is well established in U.S. leagues. In Europe, however, it has not been duly developed. This study focuses on the top three European team competitions: the EuroLeague, the EuroCup, and the Spanish ACB (Association of Basketball Clubs, acronym in Spanish) league. Their official websites provide access to game data for anyone who is interested, …